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□ 1: M33987. Human carbonic an,, 
[gi: 179792] 



Related Sequences. OMIM, Protein, PubMed, Taxonomy, UniSTS, 

LinkOut 



LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
COMMENT • 



FEATURES 

source 



gene 

TATA signal 
rtiRNA 

mRNA 

CDS 



HUMCAIX 2785 bp rtiRNA linear PRI 31-OCT-1994 

Human carbonic anhydrase I (CAI) mRNA, complete cds . 

M33987 

M33987.1 01:179792 
carbonic anhydrase I. 

Human EBV transformed SH B cell line DNA, and cDNA to mRNA. 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi.; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. . 
1 (bases 1 to 2785) 

Lowe.N., Brady, H. J., Barlow, J. H., Sowden,J.C., Edwards, M. and 
Butterworth, P.H. 

Structure and methylation patterns of the gene encoding human 

carbonic anhydrase I 

Gene 93 (2), 277-283 (1990) 

91033039 

2121614 

Since no intron sequences were provided this entry is treated as if 
originating from an mRNA. 

Draft entry and computer-readable sequence for [Unpublished (1990)] 

kindly submitted 

by N.Lowe, 09-MAY-1990. 

Author address: N.Lowe 

Department of Biochemistry 

University College London 

Gower Street, London 

WCIE 6BT, U.K. 

E-mail : UCBCMAR%EUCLID . UCL . AC . UK0CUNYVM . CUNY . EDU . 
Locati on/ Quali f ier s 
1.'.2785 

/organism="Homo sapiens" 
/db_xref="taxon: 9606" 
/map="8ql3-q22" 
1. .2785 
/gene="CAl" 
874. .879 
/gene="CAl" 
902. .1939 
/gene-"CAl " 

/note="carbonic anhydrase I mRNA (alt.)" 
902. .2165 
/gene="CAl" 

/note="carbonic anhydrase I mRNA (alt.)" 
1048. .1833 
/gene="CAl" 

/note="carbonic anhydrase I (EC 4.2.1.1)" 
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/codon_start=l 
/protein_id=='' AAA51910 . 1 " 
/db_xref="GI: 179793" 
/db_xref ="GDB : GOO-119-047 " 

/translation-"MASPDWGYDDKNGPEQWSKLYPIANGNNQSPVDIKTSETKHDTS 
LKPISVSYNPATAKEIIWGHSFHVNFEDNDNRSVLKGGPFSDSYRLFQFHFHWGSTN 
EHGSEHTVDGVKYSAELHVAHWNSAKYSSLAEAASKADGLAVIGVLMKVGEANPKLQK 
VLDALQAIKTKGKRAPFTNFDPSTLLPSSLDFWTYPGSLTHPPLYESVTWIICKESIS 
VSSEQLAQFRSLLSNVEGDNAVPMQHNNRPTQPLKGRTVRASF" 

BASE COUNT 84 0 a 603 c 4 92 g 850 t 

ORIGIN 

1 ctttagccca acagtcaaaa ataattgatg ctaccctaca aatgtccaaa actctagtat 
61 atcatatttc taagttacag caaatattag tcctgctaaa ccagggagct ttggcaaaaa 
121 tgttttttga cagtaaattt gtccttgatt atatattaac tagtcaaaga ggtgtttgta 
181 acattattag agcttcttgt tgtaggtggg ttaacaccac caatcaagag gtcattctaa 
241 cagaaagcct ggatcagaaa accatcaccc taaaaaaaca tgccttacat atttaacaca 
301 ctctgaaatc cagtcaaaat atgactaaag gcccttgcca tgactgatgt attctcctgg 

3 61 ccaacgccaa acaaatggga gcctggttac gagtcagcct tcagggactt gtcacatttc 
421 tacttggttt cttccttgtt attgtcataa taaaatgttt tctatgctgt ttagtgcaac 

4 81 ttaggcccta ttctgtagaa gtctcctcta ctattcaggc cactcaaaca ccccaaataa 
541 ttgagttcaa aatcgacatc aagatataaa ggaatcagtg actaaatata tttcatatat 
601 ggtattttta ttgattattg tgctgtcttg acctagtatg gaggccttgg ctagaggctg 
661 gtcagtttcc tctcttgagc agctgattaa atccacaccc caaccacttc ccttatcagg 
721 ttctcacact ctggggccac tatgtaccca ctctaatcac cacagggcca gacatcagac 
781 aattaaggac agcgcccatg ccccaaagcc cgccaaaatt atgcaaatta ttcaaaatta 
841 ttcaacctag ctaaccccac cctttttgct gtacataagc tgcccattcc ccctccagcc 
901 tgtggtaccc agtcctcagg tgcaaccccc tgcgtggtcc tctgtggcag ccttctctca 
961 ttcagagctg ttttccacag aggtagtgaa aagaactgga ttttcaagtt cactttgcaa 

1021 gagaaaaaga aaactcagta gaagataatg gcaagtccag actggggata . tgatgacaaa 
1081 aatggtcctg aacaatggag caagctgtat cccattgcca atggaaataa ccaatcccct 
1141 gttgatatta aaaccagtga aaccaaacat gacacctctc tgaaacctat tagtgtctcc 
1201 tacaacccag ccacagccaa agaaattatc aatgtggggc attctttcca tgtaaatttt 
1261 gaggacaacg ataaccgatc agtgctgaaa ggtggtcctt tctctgacag ctacaggctc 
1321 tttcagtttc attttcactg gggcagtaca aatgagcatg gttcagaaca tacagtggat 
1381 ggagtcaaat attctgccga gcttcacgta gctcactgga attctgcaaa gtactccagc 
1441 cttgctgaag ctgcctcaaa ggctgatggt ttggcagtta ttggtgtttt gatgaaggtt 
1501 ggtgaggcca acccaaagct gcagaaagta cttgatgccc tccaagcaat taaaaccaag 
1561 ggcaaacgag ccccattcac aaattttgac ccctctactc tccttccttc atccctggat 
1621 ttctggacct accctggctc tctgactcat cctcctcttt atgagagtgt aacttggatc 
1681 atctgtaagg agagcatcag tgtcagctca gagcagctgg cacaattccg cagccttcta 
1741 tcaaatgttg aaggtgataa cgctgtcccc atgcagcaca acaaccgccc aacccaacct 
1801 ctgaagggca gaacagtgag agcttcattt tgatgattct gagaagaaac ttgtccttcc 
1861 tcaagaacac agccctgctt ctgacataat ccagttaaaa taataatttt taagaaataa 
1921 atttatttca atattagcaa gacagcatgc cttcaaatca atctgtaaaa ctaagaaact 
1981 taaattttag ttcttactgc ttaattcaaa taataattag taagctagca aatagtaatc 
2041 tgtaagcata agcttatctt aaattcaagt ttagtttgag gaattcttta aaattacaac 
2101 taagtgattt gtatgtctat ttttttcagt ttatttgaac caataaaata attttatctc 
2161 tttctttctg ttgtgcattc agtttctaaa accattaagt ttctactcca tttacattca 
2221 aaaatcttaa atactttact tgcaagagta ttttgcttca aatacaacaa cctaagagca 
2281 gctggagatg aaatattggg aaattcattt gcttactcct gaagacaaaa atatagctga 
2341 gatgaccact ggatttaata tcgttatgct ggcccaacat tgctaccatt tgtgttgtct 
2401 gtgatcaaaa tgattatctt ttatatagga agatgacgct tctggatatt gctttcactt 
2461 cttctcccca cgttagcaag gacaatgctt ctctgccatt attacaacta gttagtttgc 
2521 atggagaatc tttactttaa aattggaaga aaagtcacaa gtgaatggtt tataaaaatg 
2581 ctaaagaagt cattcttgct tagaatcata tagaaacatc atgcaatctt ttagtcagat 
2641 gtgcgcttca ccttatgcta tttttatctt taattgacac acaataattg tacatgttta 
2701 tggagtatag tgtggtgttt tctgtttgtt tgtttgtttt ttgagacaag gtctcactct 
2761 gccagtcagg gtggagtgcg atggt 
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□ 1: AB00678L Homo sapiens niRNA...[gi:2385453] Related Sequences, OMIM, Protein, Taxonomy, LinkOut 

LOCUS AB006781 1113 bp rtiRNA linear PRI 26-JUL-2001 

DEFINITION Homo sapiens mRNA for galectin-4, complete cds . 

ACCESSION AB006781 

VERSION AB006781.1 GI: 2385453 

KEYWORDS galectin-4 . 

SOURCE Homo sapiens gastric adenocarcinoma cDNA to mRNA, clone : HP0104 9 . 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 1113) 
AUTHORS Kato,S. 

TITLE Human galectin-4 full-length cDNA cloned from gastric 

adenocarcinoma 
JOURNAL Unpublished 
REFERENCE 2 (bases 1 to 1113) 
AUTHORS Kato^S. 
TITLE Direct Submission 

JOURNAL Submitted ( 27 -AUG-1997 ) Seishi Kato, Research Institute of National 
Rehabilitation Center for the Disabled, Department of 
Rehabilitation Engineering; 4-1 Namiki, Tokorozawa, Saitama 
359-8555, Japan (E-mail : seishi@rehab . go . jp, 
Tel:042-995-3100(ex.2568) , Fax: 042-995-3132) 
FEATURES Location/Qualifiers 
source 1..1113 

/organism="Homo sapiens" 
/db_xref="taxon: 9606" 
/clone="HP0104 9" 

/tissue_type="gastric adenocarcinoma" 
CDS ' 57.. 1028 

/codon_start=l 
/product="galectin-4 " 
/protein id=" BAA22165 . 1 " 
/db_xref="GI : 2385454" 

/translation="MAYVPAPGYQPTYNPTLPYYQPIPGGLNVGMSVYIQGVASEHMK 
RFFVNFWGQDPGSDVAFHFNPRFDGWDKWFNTLQGGKWGSEERKRSMPFKKGAAFE 
LVFIVLAEHYKWVNGNPFYEYGHRLPLQMVTHLQVDGDLQLQSINFIGGQPLRPQGP 
PMMPPYPGPGHCHQQLNSLPTMEGPPTFNPPVPYFGRLQGGLTARRTIIIKGYVPPTG 
KSFAINFKVGSSGDIALHINPRMGNGTVVRNSLLNGSWGSEEKKITHNPFGPGQFFDL 
SIRCGLDRFKVYANGQHLFDFAHRLSAFQRVDTLEIQGDVTLSYVQI " 

BASE COUNT 247 a 337 c 290 g 239 t 

ORIGIN 

1 atctcccact cctgcagctc ttctcacagg accagccact agcacagcct cgaacaatag 

61 cctatgtccc cgcaccgggc taccagccca cctacaaccc gacgctgcct tactaccagc 
121 ccatcccggg cgggctcaac gtgggaatgt ctgtttacat ccaaggagtg gccagcgagc 
181 acatgaagcg gttcttcgtg aactttgtgg ttgggcagga tccgggctca gacgtcgcct 
241 tccacttcaa tccgcggttt gacggctggg acaaggtggt cttcaacacg ttgcagggcg 
301 ggaagtgggg cagcgaggag aggaagagga gcatgccctt caaaaagggt gccgcctttg 
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361 agctggtctt catagtcctg gctgagcact acaaggtggt ggtaaatgga aatcccttct 

421 atgagtacgg gcaccggctt cccctacaga tggtcaccca cctgcaagtg gatggggatc 

4 81 tgcaacttca atcaatcaac ttcatcggag gccagcccct ccggccccag ggacccccga 

541 tgatgccacc ttaccctggt cccggacatt gccatcaaca gctgaacagc ctgcccacca 

601 tggaaggacc cccaaccttc aacccgcctg tgccatattt cgggaggctg caaggagggc 

661 tcacagctcg aagaaccatc atcatcaagg gctatgtgcc tcccacaggc aagagctttg 

721 ctatcaactt caaggtgggc tcctcagggg acatagctct gcacattaat ccccgcatgg 

781 gcaacggtac cgtggtccgg aacagccttc • tgaatggctc gtggggatcc gaggagaaga 

841 agatcaccca caacccattt ggtcccggac agttctttga tctgtccatt cgctgtggct 

901 tggatcgctt caaggtttac gccaatggcc agcacctctt tgactttgcc catcgcctct 

961 cggccttcca gagggtggac acattggaaa tccagggtga tgtcaccttg tcctatgtcc 

1021 agatctaatc tattcctggg gccataactc atgggaaaac agaattatcc cctaggactc 

1081 ctttctaagc ccctaataaa atgtctgagg gtg 
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